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Si.  INTRODUCTION/SUMMARY. 

The  basic  formulation  of  the  problem  proposed  here  has  been 
introduced  and  studied  from  a  Bayesian  viewpoint  in  [5.1.  It  involves 
consideration  of  a  set  of  independent  random  lifetimes  Xj,...,Xn  which 
may  be  deterred  from  complete  observation  due  to  censoring  on  the  right 
by  another  set  of  independent  variables  Yj,...,Y  .  Thus  the  observable 
set  of  data  is  {(Z^,6.):  1  <  i  <  n}  where  for  each  1  <  i  <  n, 

(1.1)  Zj  =  X.  a  Y^  and  =  0  or  1  according  as  X^  >  or  not 

This  is  the  commonly  called  random  censorship  model  which  has 
received  considerable  attention  among  researchers  in  the  past  several 
years.  However,  as  has  been  observed  in  [5]  there  is  a  broad  class  of 
experiments  pertaining  to  clinical  trials  and  reliability  in  which  the 
Z .  are  observed  sequentially  and  cost  and/or  time  considerations  often 
entail  termination  of  experimentation  before  all  Z.  have  been  observed. 
For  example  a  study  may  be  curtailed  at  the  k  =  k(n)th  smallest  order 
statistic  Z^j,  1  <  k  <  n  and  then,  in  effect,  the  investigator  has 
at  his  disposal  only  the  data 

★  ★ 

(1.2)  {(Z(i)’6i):  1  $  1  <  k(n)l  with  6i  =  1  or  0  according  as  Z^. 

is  a  true  lifetime  or  censoring  time. 

Statistical  procedures  based  on  data  of  the  type  (1.2)  are 
referred  to  as  progressively  censored  schemes  (Sen  et.  al  (19/3,  1978)). 
Thus  instead  of  prolonged  observational  periods  until  responses  from 
the  entire  sample  have  been  recorded,  experimentation  may  be  ceased  at 
an  appropriate  intermediate  stage  with  an  attendant  desirable  reduction 
in  costs  and  time. 


In  l  5  j  the  Bayes  estimator  Fj.(n)  of  the  survival  function 


(1.3) 


F(t)  =  PfXj  >  t|  F.l ,  t  >  0 


has  been  obtained  from  the  data  (1.2),  under  squared-error  loss  when  F  follows 


a  Dirichlet  process  prior.  We  have  noted  there  that  F 


k(n) 


yields  as 


special  cases,  the  Ferguson  (1973)  estimator  of  F  whenever  all 
Xj,...,Xn  are  observable;  the  usual  empirical  survival  function  of 


Xj,...,X  and  the  estimators  given  by  Kaplan  and  Meier  (1958)  and 


Susarla  and  Van  Ryzin  (1976).  Our  objective  in  this  paper  is  the  in¬ 


vestigation  of  the  weak  convergence  of  F 


k(n)  ’ 


We  shall  show  here  that  the  process  {n'2(F^^^(t)-F(t)): 


0  <  t  <  Tj,  T  <  ™  has  exactly  the  same  asymptotic  properties  as  the 


1.  .  ~ 


process  {n'a(F  (t)-F(t):  0  <  t  <  T)  provided 


lim  inf  n  \(n)  >  1-P(  >  T | F  I  and  the  sequences  {X^:  i  >  1), 


{ Y^. ,  i  >  1}  being  each  independent  and  identically  distributed  (iid) 


and  independent  of  eacii  other.  In  view  of  this  it  is  not  unreasonable 
in  practice  to  terminate  experimentation  at  the  k(n)th  stage  when 
n  *k(n)  is  close  to  unity.  For  example  if  k(n)  =  n-c  log  n  with 
c  >  0  large,  substantial  savings  in  time  and  cost  can  be  afforded  without 
sacrifice  of  asymptotic  accuracy  yielding  a  very  cost-effective  procedure 
especially  when  per  unit  observational  costs  are  prohibitively  high. 

The  substantive  material  of  the  paper  appears  in  the  next  two 
sections.  Section  2  introduces  some  notation  and  assumptions  together 
with  a  discussion  of  our  estimator  and  special  cases  of  it.  The  proof 
of  the  weak  convergence  is  detailed  in  Section  3  and  we  make  a  few 
concluding  remarks  in  the  final  section. 


a* 
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§2.  Preliminaries 

Consider  n(>  1)  units  under  surveillance  for  which  we  record 
either  the  time  to  decrement  {survival  time)  X  or  its  competing  cen¬ 
soring  time  Y  upto  and  including  the  kn~th  response,  k^  £  {l,...,n}. 
Suppose  the  survival  distribution  F  of  X  is  a  Dirichlet  process 
with  parameter  measure  a  on  the  Borel  subsets  of  the  positive  real 
line  R+  =  (0,'-)  and,  given  F,  the  survival  times  Xj,...,X  are 
i i cl  with  distribution  function  (d.f.)  (1-F).  Furthermore,  when  the 
corresponding  censoring  times  Y ^ . ,Y  are  iid  with  continuous  right 
df  G  on  (0,~)  and  independent  of  (F,Xj,...,X  ),  we  have  demonstrated 
in  [5  ]  that  under  weighted  sguared-error  loss  the  Bayes  estimator 

F.  of  the  survival  curve  F  of  (1.3)  can  be  written 
Kn 

(2.1)  F k  (t)  =  Bn(t)Wn(t) 


where 

(2.2) 

(2.3) 


8n(t) 


«(t)  +  N*(t)  +  (n-kn)[;2(^j>t] 
«(0)  +  n 


>  f°±zwlL"l!Lujl !  (1S)  1  i 


'aU)  -  t,fj 


=  OJ 


wn(t)  =  n  .  , 

j"1 i"{Z(j))  +  Nn(Z(j)>  +  (n-kn> 


f‘(Z(k„)>  +  '"-kn 


){Z(kn>  :  ' 


with  the  abbreviation  «(u)  =  u(u,°')  and 

kn 

(2.4)  N*(t)  =  l  CZ(j)  >  t 


j=l 


4 


with  [A  I  denoting  the  indicator  of  the  set  A.  We  shall  assume  throughout 

that  «(T)  >  0  and  G(T)  >  0  where  0  <  T  <  »  is  fixed. 

The  estimator  (2.1)  may  be  regarded  as  a  natural  extension  of 

an  estimator  of  F(t)  obtained  by  Susarla  and  Van  Ryzin  (1976)  when 

the  complete  data  set  {(Z^.5-):  1  <  i  <  n)  is  available.  Indeed 

(2.1)  yields  this  estimator  by  setting  k  =  n.  On  the  other  hand  if 

★ 

there  are  no  observed  censoring  times  in  the  data  {(Z^,^.):  1  <  i  <  k^} 

(2.1)  reduces  to 


(2-5)  F(t) 


J'i'Lt  "n(i>  f  ("-kn>IZ(b.l  >  Afr  W  ♦  ("VY  Z(k,,)  = 


a(0)  +  n 


“(z(kn)) 


1 1 


which  in  turn  we  may  view  as  a  generalization  of  Ferguson's  (1973)  estimator 
«(t)  +  N*(t) 

a(oyvy  ’  w,ien  censoring  is  absent  and  kn  =  n.  The  Ferguson  estimator 

reduces  to  the  empirical  survival  function  (of  Xj,...,X  )  in  the 

limit  as  n(0)  ->  0,  while  (2.1)  for  t  <  Z,,.  .  and  the  Susarla-Van  Ryzin 
,  .  ,  ~  '  ir 

estimator  (that  is  (2.1)  with  kn  =  n)  yields  the  product-limit  estimator 
of  Kaplan  and  Meier  (1958),  which  itself  reduces  to  the  empirical  survival 
function  in  the  absence  of  censoring.  We  thus  notice  that  several 
estimators  of  the  survival  curve  that  have  found  favor  in  a  wide  variety 
of  practical  situations  have  a  common  (hitherto  unborn)  progenitor  -  the 
estimator  Fk  of  (2.1). 
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§3.  Weak  convergence  of_  F.  . 

-  .  -  --  Kn 

U  * 

The  strategy  here  is  to  express  n2(F,  -F)  in  a  more  tractable 

Kn 

form  in  which  its  behavior  will  be  made  transparent  from  the  separate 

behavior  of  B  and  W  .  We  begin  with  B  .  Define 
n  n  J  n 

1  n 

(3.1)  H  (t)  =  n  l  I  Z .  >  tj,  H { t )  =  PfZ1  >  1 1 F j  =  F(t)G(t). 

"  j=l  3  1 

Then  from  (2.4)  ( t )  =  { nll^ ( t ) -  (n-k^ )  }l  Z ^  j  >  tl  and  in  conjunction 

with  (2.2)  this  yields 

(3.2)  Bn(t)-H(t)  =  -MO)  +  n}-1{n(Hn(t)-H(t))-nHn(t)rZ(k  )  <  tj 

-a  (0  ,  t ) } . 

Writing  for  the  sup-norm  on  (0,T1  we  get 

(3.3)  ||Bn-Hj|.r  <  {a(0)  +  n]“1{nJ|llM-HjiT  +  n[Z(k  j  <  Tl  +  a(0,T)}. 

In  the  sequel  we  shall  repeatedly  use  the  fact  that  -  op(n-'  ) 

if  3  <  '2 •  Thus  with  3  <  h  we  will  also  have  from  (3.3) 

(3.4)  ||Bn-HjjT  =  0p(n_P) 
once  we  demonstrate 

(3.5)  P|Z/.  x  <  T | F J  =  0  (n-!5),  whatever  3. 

'V  “  p 

Now  assume  that  the  sequence  of  stopping  numbers  { kn :  n  >  1} 
satisfies 

y  s  lim  inf  n-1kn  >  l-H(T),  with  y  €  (0,11. 


(3.6) 
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Then  there  exists  an  integer  nQ  (depending  on  y  and  H ( T ) )  such 


that  n_1kn  >  h{y  +  l-H(T) }  whenever  n  >  nQ.  Hence  with  n  >  n( 


P[Z(k  )  <  T|F.|  <  PC  (l-Hn(T))-(l-H(T) )  >  ?z(y- ( 1-H(T ) } )  | FT , 


We  bound  this  right  hand  side  probability  by  using  the  Bernstein  inequality 
(see  Hoeffding  (1963)),  whence 


(3.7) 


P[Z(k  )  1  T1F!  <  exp(-1in{Y-(l-H(T))}Z) 


and  now  (3.5)  obtains,  whatever  g. 


To  handle  W^  we  have  to  work  much  harder.  (In  what  follows 


•1 


we  use  f  to  denote  the  reciprocal  of  the  function  f  rather  than 
its  inverse.)  First  write  (2.3)  in  the  form 


<3-8>  Wn  ■  Un,2(t> 


r«(Z(k  ;)  +  (n-k|)At2<kn)  '  tJ 


"(Z(kn)> 


Then  since  a(u)  is  nonincreasing  in  u,  we  obtain 

|’/JnWn,2jlT  f  rz(k  }  <  Thn  U  +  (n-kn)a"1(Z(k  })J 


<  [Z(k  )  <  TJno”1(T) , 


which  in  turn  yields,  in  view  of  (3.5) 


(3.9) 


jl  to  Wn>2^T  =  °p^n"e^’  whatevor  P* 


Now  introduce 


,  n 

(3.10)  Hn(t)  =  n"1  \  [1.  <  t,S.  =  01;  H(t)  =  PfZj  <  t,«j  =  0|FJ. 

J  1 


Then  from  (2.3),  (3.1)  and  (3.8)  v/e  can  express 


(3.11) 


ftiWn>i(t)  =  n  /  ^{1  +  („(x)  +  nlln(x))“  }dHn(x)  - 

2  i_i  x3 

On  utilizing  the  expansion  ^(1  +  x)  =  x  +  x  }'  (-1)  •-to  , 

j“0  J 

0  <  x  <  1  in  the  integrand  we  rewrite  (3.1)  as 


(3.12) 


Since  both  u(x),  ll^fx)  arG  nonincreasing  in  x,  we  get  on  simplification 

H  (T) 

I,  n  n 

n,?  T  '  {cc(T)+nHn(T)}^  {ft(T)+nHn(T)-l} 

which  in  conjunction  with  the  fact  that  Hn(T)  =  H(T)  +  0^(11  r)  (with 
p  <  v)  yields 


(3.13) 


jiK,1>2!!T  =  op(n"P),  whenever  p  < 


Mow  observe  that  H  of  (3.10)  can  also  be  written  /  F(x){-dG(x)l 

0 


and  therefore,  whenever  l!(t)  ^  0,  0  ^(t)1  =•  I!- 1  (t)dH(t ) .  Hence 

Cl  1- 

from  (3.12) 

^(k  )A^  t 

(3.14)  K  1(t)^G"1(t)  =  n  /  n  {«(x)  +  nil  (x)  1  “  1dH  (x)  -  /  H_1(x)dff(x) 
n ,  i  0  n  "  0 


-1, 


/  r Z / .  \  >  xJ(nMx)  +  nil  (x)}"1  -  H-1(x))dH  (x) 

0  'Sr  '  n 


+  /  H"1(x)d{Mn (x)  -  H(x) }  -  /  lZf.  n  <  x  1H“ A(x) 1M  (x) 

0  n  0  1  V 

Kn,3(t)  +  Kn,4(t)  +  Kn,5^)' 


■tl  »  ^  1 


We  first  dispose  of  Kn  ^  and  By  noting  that  H~*(x)  is  non- 

decreasing  in  x  and  performing  an  integration-by-parts,  we  easily  obtain 
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llKn,4!ii  <  2  ll~1(T)!inn-Hjjr 

and  in  view  of  the  fact  that  =  op(n~p),  p  <  h 

(3.15)  ||Knj/jjlT  =  op(n"p),  whenever  3  < 

Also  Ji Kri  5jiT  <  ir1(T)HnO)|  Z(|,  ^  <  Tl,  which  gives,  using  (3.5), 

(3.16)  ^Kn,5^T  =  whatever  p. 

T  1  i 

Finally,  <  /  |n{a(x)  +  nll^x)}"1  -  H-1(x) }  |dHn(x) 

<  { (a(T)  +  nlln(T))H(T)}'1Hn(T){«(0)  +  nj|Hn  - 

which  leads  to 


(3.17)  !!Kn ,3!iT  =  °p(n"3)’  Whenever  p  <  h. 

Thus  collecting  our  results  (3.9)  and  (3.13)  through  (3.17),  we  finally 
obtain  for  (3.8) 

(3.18)  I1  fy,  Wn  -  Pm  8~1|It  =  op(n_B),  whenever  p  <  ^ 


Ue  are  now  in  position  to  express 
Write 


n4(F.  -F) 
'n 


in  a  governable  form. 


(3.19)  (Fk  -F)  =  BnWn-HG-1 

-  (Bn-H)G-1  +  (Hn-G_1)Bn. 

2 

On  employing  the  expansion  ex  =  1  +  x  +  e,  where  c  lies  between 
0  and  x,  we  can  write 


9 


W|1-G"1  =  G_1fe  W nih  G'1)  +  >2  G'Vfon  G'1)2,  and  so  from  (3.19) 


n 


n 


we  have 


Jin?i(rk  -F)  -  nl*(Bn-H)G_1  -  n*(pn  U ^  G"1)!!  G"1^ 
5  Vnfn  G_1j)2  +  n!iPn-HjiTK  \-ti  G“1|iTG"1(T) 


which  in  view  of  (3.4)  and  (3.18)  is  o  (n  ’’),  for  some  p  <  h-  This 

V-  a 

reduces  consideration  of  weak  convergence  of  n2(F  -F)  to  that  of 

kn 


(3.20) 


n'2(B  -H)G-1  +  n^fe  W*,  G-1)HG-1 


In  (3.20)  we  can  further  replace  W  by  W  .  in  view  of 

n  n  $  I 

(3.9).  The  final  simipl i fcation  is  obtained  by  writing  (see  3.14), 

(3.21)  Pr>  Wn  ^tjf,  G_1(t)  -  {/  H„1(x)dHn(x)-  /  lf1(x)dH(x)}  +  Cp(t) 

t  1  l(v  »  >  x  h(x)  t  [  Za.  >  <  x] 

where  C  (t)  =  -  (  -  -  dfi(x)  -  /  - dH  (x). 

n  b  ll(x){;!(x)  +  nH  (x);  n  0  H  (x) 

(I  »  *  II 

« ( 0 ) H  ( I )  H  (T) 

But  j|C  j;T  < - +  rz,,  »  <  Tj  - -  ,  and  again 

"  ‘  "  Hn(T){u(T)  +  nHn(T)}  kn  _  Hp(T) 

JjCp)l1-  =  Op(n~f<),  whenever  p  <  h-  Thus  the  weak  convergence  of 

{n2(F.  (t)-F(t)):  t  c-  [0,T|1  is  tantamount  to  that  of 
n 

(3.22)  n2{B  (t)-H(t)>G_1(t)  +  n’;,{  /  H-1dH  -  /  H_1dH} 

n  0  0 

which  is  precisely  the  final  reduction  obtained  by  Susarla  and  Van  Ryzin 
(1978).  We  must  however  emphasize  here  that  the  interval  endpoint  T 
is  restricted  through  (3.6).  As  noted  earlier  the  estimator  F^ 
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reduces  to  that  studied  by  Susarla  and  Van  Ryzin  (1976,  1978)  wlien 
kn  ..  n,  in  which  situation  (3.6)  degenerates  to  the  condition  ll(T)  >  0 
that  they  imposed. 

The  analysis  of  the  process  (3.22)  has  been  dealt  with  in  Susarla 

and  Van  Ryzin  (1978)  and  Breslow  and  Crowley  (1974).  He  therefore  state 

our  results  for  F.  in 
kn 

Theorem:  Suppose  F.  is  defined  by  (2.1)-(2.4)  and  the  sequence 

n 

{k^:  n  _•  1}  satisfies  (3.6).  Assume  F(t),  G ( t )  continuous  on 

I 

10,'ri.  Then  the  process  tn'2(F.  (t)-F(t)):  t  G  [0,T|}  converges  weakly 

n 

to  a  Cans si an  process 

U  =  -G_1P  +  /  lf2PdH  »  H_1Q  +  /  H~2Qdil 
0  0 

where  P,Q  are  themselves  mean  zero  Gaussian  processes  wi  th  a  covariance 
structure  given  for  s  <  t,  by 

Cov((’(s) ,  P(t))  -  H( t) ( l-H(s) ) 

Cov(Q(s ) ,  Q(t) )  =  fi(s)(l-fl(t)) 

Cov(I’(s),  Q(t) )  -  M(s)-H(t)(l-ll(s)) 

Cov(Q(s) ,  P ( t ) )  -  fi(s)M(t) . 

Indeed  U  has  mean  zero  and  covariance  given  for  s  <  t  by 

Cov(U(s),  U(t))  =  -F(s)i  (t)  /  ii" V-1dF. 

0 
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§4.  Concluding  Remarks. 

In  this  paper  we  have  assumed  the  censoring  variables 

to  be  iid.  Under  appropriate  conditions  our  results  can  be  extended  to 

the  case  where  the  Y-'s  have  different  distributions.  In  addition 

to  the  weak  convergence  result  for  {F.  :  n  >  1)  one  can  demonstrate 

kn 

the  uniform  strong  consistency  and  uniform  mean -square  consistency  of 

(F.  :  n  >1}  as  a  estimator  of  F  on  r 0 ,T  I . 

Kn 

Finally  confidence  bands  for  lF(t):  t  6  |0,TP  can  be  calculated 

in  terms  of  {F,  (t):  t  €  C 0 , T  I }  using  the  distribution  of  the  limiting 
it 

process  { U ( t ) :  t  €  I 0,T  1}  of  our  Theorem.  Note  that  the  process 
.  (Fk  (t)  -  r ( t ) ) 

In2  __n _ :  t  e  I0,T.|}  converges  weakly  to  a  time-transformed 

F(t) 


Brownian  motion. 
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